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Abstract 

.  Tills  paper  describes  a  parsing  system  specifically 
designed  for  spoken  rather  than  written  input.  The 
parser  is  part  of  «  project  in  progress  at  Stanford 
Research  Institute  to  develop  a  computer  system  for 
understanding  speech.  The  approach  described  uses  as 
much  heuristic  knowledge  as  possible  in  order  to  mini¬ 
mize  the  demands  on  acoustic  analysis. 

Parsing  Speech 

Parsing  continuous  spoken  English  is  a  signifi¬ 
cantly  different  problem  than  parsing  written  English. 
There  are  at  least  two  major  differences  between  spoken 
and  textual  input. 

First,  spoken  input  is  not  as  easily  decoded  ns 
text  input.  There  are  difficulties  associated  with 
establishing  the  boundaries  of  the  particular  words  in 
the  acoustic  stream,  with  accounting  for  the  variations 
of  a  word  by  different  speakers,  for  different  pronun¬ 
ciations  of  a  word  by  the  same  speaker,  and  for  differ¬ 
ent  pronunciations  of  the  same  word  in  different  con¬ 
texts,  Recognition  of  spoken  input  words  is  much  more 
complex  and  much  less  reliable  than  recognition  of 
written  input  words , 

Second,  speeeh  provides  information  by  prosodic 
features — intonation,  stress,  pause,  juncture. 2  For 
example,  intonation  contours,  the  variations  in  piteh 
and  rhythm  in  an  utterance,  provide  clues  to  the  loca¬ 
tion  of  phrase  boundaries  and  to  syntactic  relation¬ 
ships  of  the  words,  (Work  is  now  being  done,  particu¬ 
larly  at  the  UNI VAC  Corporation3  and  at  the  University 
of  Michigan,4)5  on  how  to  extract  such  information  from 
the  acoustic  signal  and  how  to  incorporate  it  into  a 
grammar  for  the  computer.) 

A  Strategy  for  Parsing  Speeeh 

In  one  approach  to  parsing  text,  the  parser  reads 
the  input  words,  looks  them  up  In  the  lexicon,  and  uses 
their  syntactic  features  for  guidance.  This  approach 
is  much  less  desirable  for  pursing  speech  because  of 
the  problem  of  reliably  separating  and  identifying  the 
input  words  using  current  acoustic  techniques. 

Because  of  this  limitation  of  acoustic  recognition, 
we  have  chosen  to  restrict  the  detailed  acoustie  pro¬ 
cessing  to  testing  words  that  have  been  hypothesized  by 
the  parser  on  the  basis  of  a  wide  range  of  syntactic, 
semantic,  and  prosodic  knowledge.  We  expect  the  shift 
to  verification  of  parser-proposed  words  to  lead  to  more 
reliable  acoustic  decisions  since  the  verification 
algorithms  can  be  tnilormade  for  the  individual  words 
and  can  take  context  into  consideration.  Moreover,  by 


hypothesizing  words  in  order  of  their  likelihood  in  a 
particular  context,  we  should  be  able  to  reduce  the 
average  number  of  incorrect  words  proposed  and,  hence, 
the  potential  for  errors  in  acoustic  recognition. 

An  early  version  of  a  system  partially  using  this 
approach  has  been  put  together  by  making  modifications 
to  Terry  IVinograd's  program.6)’7  Our  new  system  is  not 
directly  based  on  Winograd's  program,  but  it  reflects 
his  influence.  It  also  has  been  influenced  by  the  work 
of  Bill  Woods  and  Rou  Kaplan  on  parsing  systems ,® < 3 >  10 

While  keeping  the  approach  mentioned  above,  the 
new  system  differs  from  the  old  one  in  many  ways.  Sig¬ 
nificantly,  it  uses  a  "best-first"  parsing  strategy  in 
place  of  the  more  conventional  "depth-first”  strategy. 
Both  strategies  are  designed  to  deal  with  "choice 
points”  in  the  grammar--places  where  there  are  several 
alternatives  for  continuing  the  parse  but  not  enough 
Information  available  to  decide  among  them.  With  the 
depth-first  strategy,  choice  points  are  handled  by  using 
"backtracking."  (A  single  path  from  the  choice  point  is 
pursued  until  an  inconsistency  with  the  input  is  found. 
At  that  time,  the  path  is  permanently  abandoned.  The 
parser  is  backed  up  to  the  most  recently  encountered 
choice  point  to  try  the  next  alternative.) 

There  are  several  reasons  why  this  depth-first 
method  is  unattractive  for  parsing  speeeh.  First,  the 
uncertainty  of  acoustic  recognition  makes  it  difficult 
to  decide  conclusively  whether  n  path  should  be  aban¬ 
doned.  With  speech,  it  makes  core  sense  to  deal  with 
the  likelihood  of  a  path  than  with  a  eatcgoricol  accep¬ 
tance  or  rejection  .  A  second  objection  is  that  the 
depth-first  strategy  makes  it  impossible  to  hypothesize 
words  in  order  of  their  likelihood.  Before  the  other 
alternatives  at  a  choice  point  can  be  considered,  all 
possibilities  for  satisfying  the  first  alternative  must 
be  explored.  This  forces  the  testing  of  all  words  cor¬ 
responding  to  the  first  alternative  regardless  of  their 
likelihoods.  A  final  objection  to  the  depth-first 
strategy  is  that  it  explores  many  more  false  paths  than 
a  strategy  using  heuristic  knowledge  to  guide  the  parse. 
Extra  false  paths  are  particularly  bad  for  speech 
beeause  of  the  high  cost  of  the  acoustic  tests  needed 
to  determine  that  a  path  is  a  dead  end  and  the  danger 
of  following  false  paths  further  than  necessary  due  to 
uncertain  acoustic  recognition.  Research  in  Artificial 
Intelligence  has  shown  that  substantial  increases  in 

efficiency  can  result  from  the  application  of  heuristic 

* 

knowledge  to  guide  searches.  Equivalent  gains  should 
be  possible  if  knowledge  is  used  to  guide  the  search  for 
successful  parse, 


*See  for  instance  Nilsson’s  book  and  the  work  referenced 
there  ■  ’ 1 
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These  considerations  nil  support  a  best-first 
parsing  strategy.  In  this  approach,  each  new  path 
resulting  from  a  choice  point  is  assigned  a  priority 
according  to  its  estimated  likelihood  of  leading  to  a 
correct  parse/  The  paths  are  then  added  to  the  set  of 
all  paths  that  have  been  generated  but  not  yet  extended 
during  this  parse.  The  system  follows  the  highest  pri¬ 
ority  path  from  the  comprehensive  set  until  its  priority 
drops  or  it  reaches  a  choice  point.  At  that  time  the 
cycle  repeats.  Since  the  new  path  chosen  need  not  be 
one  of  the  successors  of  the  previous  path,  the  parser 
will  not  necessarily  continue  along  a  single  path  until 
it  reaches  a  dead  end.  Instead,  it  will  suspend  a  path 
when  there  is  an  alternative  available  with  a  higher 
estimated  likelihood.  It  will  resume  the  original  at 
a  later  time  if  it  becomes  most  likely  again. 

The  best-first  method  avoids  the  objections  made 
to  the  depth-first  strategy.  First,  it  is  not  neces¬ 
sary  to  decide  conclusively  that  a  path  is  inappro¬ 
priate,  to  force  a  f,yes  or  no."  The  gradations  of 
confidence  in  acoustic  tests  can  be  reflected  in  a 
range  of  priorities.  Second,  words  can  be  hypothe¬ 
sized  in  order  of  their  likelihood  since  there  are  no 
constraints  imposed  by  the  parsing  strategy  on  the 
order  in  which  paths  are  explored.  And  finally,  the 
number  of  false  paths  explored  can  potentially  be 
reduced  by  using  heuristic  knowledge  to  guide  the 
parse . 

Of  course,  this  potential  for  reducing  false  paths 
can  be  realized  only  if  extensive  knowledge  is  success¬ 
fully  incorporated.  V>'i  no  grad  made  a  noteworthy  step  in 
this  direction  by  using  semantics  to  filter  out  unin¬ 
terpretable  phrases  as  soon  as  they  were  parsed.  But 
more  can  be  done;  knowledge  can  be  used  to  guide  the 
parsing  rather  than  to  act  simply  as  a  passive  filter, 
and  a  wider  range  of  knowledge  can  be  used  .  For 
instance,  in  addition  to  semantics,  the  parser  could  be 
guided  by  such  things  as  prosodies,  statistics  regard¬ 
ing  vocabulary  and  syntactic  constructs,  models  of  the 
user  and  the  dialog. 

Control  Structures 

The  attempt  to  incorporate  more  knowledge  may 
appear  foolhardy  in  the  light  of  the  great  complexity 
of  existing  systems,  Wi  nograd  himself  has  remarked 
that  his  program  was  nearing  the  limits  of  comprehen¬ 
sibility;  our  early  system  was  certainly  no  hotter. 

To  combat  this  problem,  we  have  introduced  control 
structures  to  encourage  the  careful,  systematic  use  of 
heuristic  knowledge  and  aid  clarity  of  the  parser. 

The  control  structures  differentiate  between  the 
priority  functions  that  embody  the  special  knowledge 
and  the  other  parser  functions  that  embody  the  grammar. 
The  grammar  functions  alone  define  the  possible  paths, 
while  the  heuristic  functions  control  the  order  in 
which  the  paths  will  be  explored.  In  this  way  the 
grammar  is  not  obscured  by  the  logic  needed  to  assign 
priorities.  The  division  is  maintained  by  providing 
each  alternative  with  its  own  priority  function.  This 
both  furthers  the  incorporation  of  knowledge  and  sim¬ 
plifies  the  development  of  heuristics,  We  do  not  have 
to  try  to  build  an  omniscient  semantics  module  capable 
of  evaluating  any  configuration;  our  system  will  use 


the  particular  semantic  considerations  in  conjunction 
with  other  relevant  information  specific  to  a  place  in 
the  grammar. 

Control  structures  also  replace  GOTO's  with  func¬ 
tions  that  give  a  clear  and  explicit  form  to  the  stan¬ 
dard  parsing  operations  such  as  listing  alternatives 
and  identifying  optional  elements.  We  agree  with 
Dijkstra  and  others  who  find  GOTO’s  particularly  harm¬ 
ful  to  program  clarity . 12_lw!  Many  other  parsing  sys¬ 
tems  have  treated  the  program  as  a  collection  of  labeled 
blocks  of  instructions  with  control  transferred  arbi¬ 
trarily  by  explicit  reference  to  labels.  This  amounts 
to  the  unconstrained  use  of  GOTO's  whether  the  particu¬ 
lar  syntactic  form  is  a  multidirection  branch  statement, 
a  state  transition,  or  a  production  language  formalism. 
The  elimination  of  GOTO’s  is  probably  the  most  important 
way  in  which  the  control  structures  reduce  the  complex¬ 
ity  of  the  parser. 

Finally,  there  are  advantages  in  developing  the 
control  structures  as  an  extension  to  the  LISP  language. 
These  include  complete  freedom  in  the  use  of  procedures 
for  structuring  the  system,  availability  of  the  control 
and  data  manipulation  facilities  of  LISP,  and  compati¬ 
bility  with  standard  debugging  and  program  development 
aids  . 

Before  describing  the  control  structures  that  have 
been  added  to  LISP,  it  is  useful  to  review  the  overall 
parsing  strategy  and  introduce  some  new  terminology. 

As  the  system  attempts  to  understand  an  input,  it  uses 
the  grammar  to  generate  a  sequence  of  paths.  Corre¬ 
sponding  to  each  path  is  a  "process."  The  likelihood 
of  a  path  is  determined  by  priority  functions  and  is 
reflected  in  the  value  of  the  priority  for  the  process. 
Priorities  are  positive  numbers  whose  values  are  irrel¬ 
evant  except  to  establish  an  order  of  all  the  processes. 
The  highest  priority  process  is  run  until  either  its 
priority  drops  as  a  result  of  some  test  or  it  reaches 
a  choice  point.  In  cither  case,  control  is  trans¬ 
ferred  to  the  highest  priority  member  of  the  list  of 
processes.  Parsing  continues  until  an  acceptable 
interpretation  of  the  input  is  found  or  some  resource 
bound  is  exceeded. 

With  this  strategy  in  mind  we  can  discuss  the 
actual  control  functions.  The  most  basic  of  these  is 
ALT,  which  is  used  to  list  alternatives.  Its  syntax 
is  (ALT  altj  ...altn),  where  each  alt ^  is  of  the  form 
(altname  altpriority  e^  ...  en) .  Altname  is  a  unique 
name  for  this  alternative.  The  use  of  the  name  is 
described  in  the  section  PATHS  and  MAPS.  Altpriority 
is  evaluated  to  determine  the  priority  to  be  given  the 
process  corresponding  to  this  alternative.  Finally, 

Cj  ...  en  specify  the  action  for  this  alternative. 
Actions  can  include  calling  other  control  functions  or 
procedures  that  include  control  functions.  In  other 
words,  there  is  complete  freedom  in  dynamically  nesting 
control  structures.  The  effect  of  the  ALT  is  to  replace 
one  process  by  several  new  ones.  Each  new  process 
independently  can  continue  the  computation  started  by 
the  original  process.  When  one  has  finished  its  action 
...  en,  it  can  immediately  go  on  to  the  statement 
f ol low! ng  t  he  ALT , 
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A  second  control  function  is  OPTION,  which  is 
used  to  identify  optional  components  in  the  grammar. 

Its  form  is  (OPTION  optname  optpriori ties  e^  .  ..  en), 
where  optname  is  a  unique  name  for  this  option, 
optpriorities  evaluates  to  a  pair  of  priorities,  and 
el  * ■ *  en  sPecify  the  optional  action.  The  first  of 
the  priorities  is  assigned  to  a  process  that  will 
execute  e^  . ..  en,  and  the  second  to  a  process  that 
will  simply  go  directly  to  the  successor  of  the  OPTION. 

SEQUENCE  is  similar  to  OPTION.  On  entry  to 
SEQUENCE  the  process  splits  into  two- one  which  exe¬ 
cutes  e^  ...  en,  and  one  which  does  not.  However, 
unlike  OPTION,  the  first  process  reenters  SEQUENCE 
after  completing  en .  This  leads  to  computing  another 
pair  of  priorities  and  spliting  into  one  process 

which  executes  e,  . .  .  e  a  second  time  and  one  which 
1  n 

does  not.  In  this  manner,  the  SEQUENCE  statement  can 

be  reentered  an  arbitrary  number  of  times  to  parse  an 

arbitrarily  long  sequence  of  constituents  satisfying 

e-,  .  .  .  e  . 

t  n 

The  fourth  control  function  is  0PT10NALIF.  Its 
form  is  (OPTIONALIF  condition  name  priorities 

en  ...  e„) .  If  the  condition  is  true  then  e.  ...  e 
^  n  l  n 

are  optional;  otherwise  they  are  required.  The  name 

and  priorities  for  OPTIONALIF  are  like  those  for 
OPTION  and  are  used  only  if  the  condition  is  true. 

The  final  function  is  PARSE,  Used  to  call  the 
program  for  a  grammatical  unit.  Its  syntax  is 
(PARSE  unit  argi  ...  arg  ),  where  unit  is  the  name  of 
the  function  to  be  called  for  a  grammatical  unit  and 
arg-^  ...  argn  are  optional  arguments  for  that  fuuc- 
tion.  The  result  of  PARSE  is  a  parse  tree  for  the 
unit .  PARSE  also  plays  an  important  role  in  tbe 
implementation  as  discussed  below. 

An  Example 

Having  discussed  the  major  elements  of  the  parser, 
we  will  now  present  a  sample  grammar  for  a  noun  phrase 
(Figure  1).  Unfortunately,  an  example  small  enough  to 
present  here  cannot  fully  demonstrate  the  value  of 
this  approach  in  developing  a  large  system.  But  this 
example  can  illustrate  the  format  of  a  grammar.  List¬ 
ings  of  a  much  more  complete  grammar  are  available 
from  the  authors. 

The  example  grammar  is  a  LISP  program  called 
NOUNGROUP.  The  first  statement  in  it  is  an  ALT.  In 
this  example,  the  first  alternative  is 

(ARTICLE  (ARTPRESENT) 

OVDTYPE  ARTICLE)) 

Alt name  is  "ARTICLE, "  altpriority  is  computed  by 
tile  functiou  ARTPRESENT,  and  the  action  for  this 
alternative  is  the  call  to  WD TYPE .  The  function  WDTYPE 
finds  in  the  input  a  word  of  the  category  specified  by 
its  argument. 

Ill  this  example,  alt^  looks  for  an  initial  article 
in  the  noun  group.  Altg  through  alt^  look  for  a 
demonstrative  adjective,  quantifier,  pronoun  and  thing 
pronoun  respectively.  The  last  alternative,  altg 
(labeled  NULL)  allows  the  noun  group  to  start  directly 


with  an  adjective  or  noun.  The  statement  following 
tile  ALT  controls  the  parsing  of  optional  adjectives 
and  the  head  noun.  These  are  not  allowed  after  a 
THINGPRON,  are  optional  after  either  a  BEM0NSTRAT1 VEABJ 
or  a  QUANTIFIER,  and  are  required  in  all  other  cases . 
This  is  represented  in  the  program  by  an  OPTIONALIF 
inside  a  COND.  The  final  statement  of  NOUNGROUP  looks 
for  modifying  relative  clauses  or  prepositional  phrases 

This  grammar  will  find  constructions  such  as: 

"the  big  green  table” 

"that  part" 
some  narrow  pieces" 

"something” 

"everything  green” 

"he, "  "it" 

All  of  these  except  the  pronouns  can  be  followed 
by  a  sequence  of  modifying  phrases.  Relative  clauses 
such  as  "which  you  dropped"  and  "that  was  picked  up" 
or  prepositional  phrases  such  as  "on  the  floor"  and 
"under  the  table"  are  modifying  phrases. 

Word  Verifier 

Another  important  part  of  the  parsing  system  is 
the  word  verifier.  This  component  tests  acoustic  data 
for  the  presence  of  words  from  such  terminal  categories 
as  noun,  verb,  and  adjective  that  are  predicted  by  the 
parser  according  to  some  path  through  the  grammar.  The 
words  in  the  terminal  category  can  be  thought  of  as 
alternatives,  exactly  like  the  higher  syntactic  alter¬ 
natives  in  the  grammor,  that  should  be  considered  in 
order  of  their  likelihood.  The  same  structure  of  alter¬ 
natives  and  priorities  is  here  found  on  the  level  of 
word  verification.  Estimated  likelihoods  con  not  only 
order  the  words  in  the  terminal  category,  but  also 
defer  acoustic  processing  as  long  as  higher  priority 
alternatives  remain  to  be  explored. 

The  word  verifier  must  first  establish  a  priority 
for  each  candidate  word  and  then,  according  to  those 
priorities,  schedule  acoustic  verification.  Correspon¬ 
ding  to  these  two  operations  there  is  a  priority  func¬ 
tion  and  a  verification  function  associated  with  each 
Word  in  the  vocabulary.  The  tests  made  by  the  priority 
function  are  computationally  simple,  although  broad  in 
scope.  Tile  results  of  preliminary  acoustic  processing 
arc  consulted  to  ensure  that  the  word  is  at  least  feas¬ 
ible.  The  frequency  of  occurrence  of  the  word  and  the 
likelihood  of  co-occurrence  of  the  word  with  recently 
used  words  are  considered.  Finally,  the  semantic 
features  of  the  word  are  compared  to  the  semantic 
restrictions  of  the  context. 

In  contrast  to  the  quick  tests  made  by  the  pri¬ 
ority  function,  the  verification  function  may  perform 
extensive  aconstic  analyses  to  determine  how  well  the 
proposed  word  matches  the  next  portion  of  input.  The 
details  of  this  processing  are  beyond  the  scope  of  this 
paper  but  have  been  sketched  elsewhere.7  Since  the 
verification  function  depends  only  on  acoustic  informa¬ 
tion,  its  results  are  saved  so  that,  if  another  process 
looks  for  the  same  word  in  the  same  place,  the  prior 
results  can  be  Used.  Of  eoursc,  the  results  of  the 
priority  functions  cannot  be  reused  in  this  manner 
since  they  are  extremely  dependent  on  context. 
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(NOUNGROUP 
(LAMBDA  NIL 
(PROG  NIL 

(ALT  (ARTICLE  (ARTPRESENT) 

(WDTYPE  ARTICLE)  (*  A  AN  THE) 

) 

(DEMONSTRATI VEADJ  (DEHADJPRESENT ) 

(WDTYPE  DEMONSTRATI VEADJ) 

(*  THIS  THAT  THESE 
THOSE) 


) 

(QUANTIFIER  (QNTFRPRESENT) 

(WDTYPE  QUANTIFIER) 

(*  SOME  NONE  NO  NEITHER 
MOST  MANY  EVERY  EITHER 
EACH  BOTH  ANY  ALL) 

) 

(PRONOUN  (PRONPRESENT) 

(WDTYPE  PRONOUN) 

(RETURN)) 

(THINGPRONOUN  (TPRONPRESENT) 

(WDTYPE  THINGPRONOUN) 

(*  SOMETHING  NOTHING 
EVERYTHING  ANYTHING) 
(OPTION  THINGADJ  (ADJAFTERTPRON ) 

(WDTYPE  ADJECTIVE)) 

(*  'SOMETHING  RED') 


(NULL  (NOART) 
)) 


(*  NOUNGROUP  STARTS  WITH 
ADJECTIVE  OR  NOUN) 


(COND 

(THINGPRONOUN  NIL  (*  ADJECTIVE-NOUN  NOT 

ALLOWED  AFTER 
THINGPRONOUN)) 

(T  (OPTIONALIF  (OR  DEMONSTRATI VEADJ  QUANTIFIER) 

ADJNOUN 

(ADJECTI VENOUNPRESENT) 

(*  AFTER  DEMONSTRATIVE 
ADJECTIVES  AND 
QUANTIFIERS 
ADJECTIVE-NOUN  ARE 
OPTIONAL) 

(ADJNOUN)  (*  PARSES  OPTIONAL 

ADJECTIVES  AND  HEAD 
NOUN) 

))) 

(SEQUENCE  ENDINGS  (ENDING) 

(ALT  (CLAUSE  (CLAUSEAFTER) 

(PARSE  CLAUSE  RELATIVE)) 

(PREPOSITION  (PREPAFTER) 

(PARSE  PREPGROUP ))))))) 


Figure  1 


PATHS  and  MAPS 

In  the  word  verifier,  we  have  seen  how  one  pro¬ 
cess  can  use  information  gained  by  another;  the 
results  of  a  verification  function  can  be  used  by  a 
subsequent  process  looking  for  the  same  word.  The 
next  step  is  to  share  the  syntactic  structures  that 
are  found.  But  the  processes  may  have  had  different 
contextual  constraints  and  may  need  to  make  different 
tests,  problems  for  direct  sharing.  We  have  added 
PATHS  and  MAPS  to  the  parsing  system  to  facilitate 
sharing  and  to  allow  for  additional  tests.* 


A  path  describes  the  flow  of  control  through  the 
grammar  leading  to  the  current  state  of  a  process. 

For  example,  at  ALT  statements  the  name  of  the  alter¬ 
native  taken  is  added  to  the  path,  and  when  a  word  is 
recognized  in  the  input  it  also  is  added.  The  actual 
structure  of  the  path  is  a  list  of  names  and  words  with 
the  most  recent  history  of  the  process  at  the  front  of 
the  list.  A  map  is  the  part  of  the  path  that  was 
followed  by  a  process  when  it  successfully  parsed  some 
grammatical  unit.  The  function  PARSE  automatically 
records  the  map  before  it  returns,  and  it  checks  for  a 
map  when  it  is  called  . 


* 


This  is  closely  related  to  the  well  formed  sub¬ 
string  facility  described  by  Woods.2 
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The  control  functions  (ALT,  OPTION',  OPTIONAL1F, 
and  SEQUENCE)  consult  the  map  and  raise  the  priority 
of  the  named  alternative.  The  map  thus  helps  to  guide 
the  process  which  is  parsing  a  constituent  by  con¬ 
tributing  data  on  alternatives  successful  in  prior 
attempts . 

Nevertheless,  it  is  still  possible  that  the  map 
will  fail  due  to  differences  in  the  context.  When  a 
proeess  is  forced  off  the  path  specified  by  a  map,  it 
simply  reverts  to  the  standard  mode  of  parsing.  This 
ensures  that  the  use  of  maps  will  not  result  in  the 
acceptance  of  a  previously  parsed  constituent  that 
fails  to  meet  the  eurreut  constraints.  The  map  merely 
serves  as  heuristic  knowledge  to  guide  the  reparse  and 
cau  be  overridden  by  other  considerations. 


Finally,  the  map  must  take  into  account  the  fact 
that  there  may  be  more  than  oue  successful  parse  of, 
say,  a  noun  group  starting  at  a  particular  location 
in  the  input.  In  its  most  general  form  a  map  is  a 
tree  structure  with  each  path  through  the  tree  corre¬ 
sponding  to  a  successful  parse.  If  a  branch  point  is 
reached  while  following  a  map,  the  several  alternatives 
will  be  given  raised  priorities. 

It  is  worth  noting  a  second  use  for  the  path  as  a 
debugging  tool.  The  parsing  strategy  causes  control 
to  move  among  a  large  number  of  active  processes,  and 
the  computation  leading  to  the  current  state  of  a  pro¬ 
cess  will  have  been  interlaced  with  the  activity  of 
other  processes.  Standard  tracing  techniques  are  less 
useful  for  debugging,  and  the  path  is  a  valuable  aid 
in  determining  how  a  process  has  reached  its  current 
state. 


Implementation 


The  parser  will  be  implemented  in  BBN-LlSPlc 
using  the  multiple  environment  control  structure 
facility  described  by  Bobrow  and  Wcgbreit . lb  Among 
other  things,  this  facility  generalizes  the  standard 
linear  stack  to  a  tree  structure.  Figure  2  intro¬ 
duces  some  terminology  for  discussing  such  a  tree. 


Non-Terminal  Section 
Branch  Point 
Terminal  Section 
Tip 


Tile  root  corresponds  to  the  base  of  an  ordinary 
stack,  and  each  tip  hehaves  like  the  top  of  an  ordin¬ 
ary  stack.  Jn  the  parser  there  is  a  one-to-one  cor¬ 
respondence  between  the  tips  of  the  stack  structure 
and  processes.  When  a  process  calls  a  function,  the 
variables  for  the  function  are  added  to  the  tip  for 
that  process,  and  when  the  function  returns,  the  var¬ 
iables  are  removed.  The  path  from  the  root  to  the 
tip  contains  all  the  variables  for  the  process.  At 
branch  points  the  paths  for  several  processes  join. 
Thus  the  variables  in  the  path  from  the  root  to  the 


brouch  point  are  shared  by  more  than  one  process.  Only 
the  variables  in  terminal  sections  can  be  modified  by 
the  process  without  interfering  with  other  processes; 
the  variables  in  nonterminal  sections  must  not  be 
changed.  This  presents  a  problem  when  the  process 
wishes  to  return  from  the  first  (nearest  the  root) 
function  in  the  terminal  section  to  the  last  function 
in  the  last  nonterminal  section.  The  process  cannot 
use  that  section  beeause  it  is  shared.  The  solution  is 
to  make  a  copy  of  it  that  will  have  the  same  relation 
to  other  sections  of  the  path  as  the  shared  seetion  but 
will  be  for  private  use  of  the  process.  To  ensure  that 
this  copying  is  done  properly,  eaeh  stack  seetion  begins 
with  a  call  of  PARSE,  and  instead  of  returning  in  the 
normal  manner,  PARSE  makes  a  eopy  of  the  next  section 
of  the  tree  and  transfers  control  to  it.  This  provides 
a  new  terminal  section  for  the  process  and  returns  con¬ 
trol  to  the  function  that  called  PARSE  originally. 

Figure  3  shows  the  effect  on  the  stack  structure  of 
such  a  return  by  PARSE. 


In  (a),  process  p  is  running  in  terminal  stack  sec¬ 
tion  B.  In  (b),  p  has  ’'returned"  from  B  to  ,  a 
copy  of  A,  and  B  has  been  deleted. 

Figure  3 


Copying  is  also  necessary  when  new  processes  are 

created.  Figure  4  shows  a  portion  of  the  stack  before 

♦  . 

and  after  an  ALT  statement.  When  a  process  is  killed 
its  terminal  stack  section  is  deleted. 


(a)  (b) 

In  (a),  p,  running  in  Section  A,  is  about  to  reach  an 
ALT  statement.  In  (b),  p  has  been  replaced  by  processes 
Pi  tln'ough  pn,  each  with  a  copy  of  A,  corresponding  to 
tlie  n  alternatives  of  the  ALT. 


Figure  4 


This  is  not  quite  true.  In  the  actual  system  the  copy 
Aj  is  not  made  until  p.^  becomes  the  highest  priority 
process.  Thus  all  the  new  processes  which  have  not  yet 
been  activated  share  the  same  terminal  seetion. 
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There  are  two  main  restrictions  on  the  parser 
caused  by  this  implementation.  First,  data  structures 
which  are  not  intended  to  be  global  to  all  processes 
cannot  be  changed  except  after  copying.  This  is  to 
avoid  interference  between  processes  which  may  be 
sharing  the  same  structures.  For  example,  if  process 
p  in  Figure  4a,  has  a  variable  X  whose  value  is  a  list, 
then  processes  pi  through  pn  in  Figure  4b  will  all  have 
variables  pointing  to  the  same  list  (in  LISP  termin¬ 
ology,  the  X’s  will  be  EQ) ,  To  change  the  value  of  X, 
process  p^  must  store  a  new  pointer  in  X  rather  than 
modify  the  original  structure  pointed  to  by  X.  While 
this  restriction  has  affected  the  design  of  the  parser, 
it  has  not  been  a  serious  problem. 

The  second  main  restriction  on  the  parser,  that 
only  variables  within  the  terminal  section  of  the 
staek  can  be  changed,  did  present  a  problem.  Certain 
variables  such  as  the  current  position  in  the  input 
are  private  to  each  process  but  global  within  the 
process.  The  solution  is  to  identify  such  variables 
to  the  system  as  ’process  globals."  On  entry  to  PARSE, 
the  process  globals  are  rebound  to  their  old  values. 
Thus  they  are  within  the  terminal  section  and  can  be 
given  new*  values.  Before  PARSE  exits,  it  propagates 
the  proeess  globals  back  up  the  stack.  For  example, 
in  going  from  Figure  3a  to  Figure  3b,  the  process 
globals  for  p  are  propagated  from  section  B  to  section 
.  By  generalizing  this  scheme,  variables  can  appear 
to  be  bound  at  any  level.  Thus  we  can  eliminate  the 
second  restriction  on  the  parser  by  identifying  certain 
special  variables  to  the  system. 

The  implementation  has  the  prime  advantage  of 
avoiding  a  special  interpreter  for  the  parsing  language. 
Since  the  "language”  consists  of  additional  LISP  func- 
tions,  the  entire  parser  con  be  compiled  with  the 
standard  LISP  compiler  and  debugged  with  the  standard 
LISP  debugger.  This  means  faster  exeeution  and  easier 
debugging.  Of  course,  the  biggest  improvement  in 
efficiency  relies  on  knowledge  to  guide  the  parser, 
but  compilation  will  give  a  substantial  speed  up  in 
addition . 

Conclusion 

Tlie  system  described  in  this  paper  is  part  of  an 
ongoing  research  project  in  speech  understanding  and 
is  intended  to  provide  a  solid  basis  for  continuing 
work .  There  are  many  other  aspects  to  speech  under¬ 
standing  besides  the  development  of  a  parser  system. 

It  has  been  to  our  benefit  to  be  part  of  a  speech  pro¬ 
ject  at  SRI  that  includes  personnel  interested  in  a 
variety  of  tasks  from  acoustie  signal  processing 
through  semantic  representation  to  overall  system 
organization . 

A  sizeable  grammar  for  English  has  been  written 
using  this  parsing  system.  Although  it  is  difficult 
to  characterize  the  scope  of  a  grammar,  it  appears  to 
be  as  extensive  as  others  we  are  familiar  with  (in 
particular  those  of  Winograd6  and  Woods,  et  al.9). 

Our  current  efforts  center  on  acoustic  processing 


* 

These  functions  of  course  depend  on  the  multiple 
environment  control  facility  in  BBN-L1SP . 


routines  for  use  in  word  verification,  priority  func¬ 
tions  for  grammatical  alternatives,  and  semantic  model¬ 
ing  of  the  domain  of  discourse. 
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